BIOTECHNOLOGY JkB qg-O- 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 

Source: /4^7^? " 

Date Processed by STIC: *2y <{3/2p0Z~ 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703- 308-4216. 
PATENTIN 2.1 e-mail help: patin21help@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-41 19 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.1 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.nspto.gov/web/offices/pac/checker 

Applicants submitting genetic sequence;irrfqrmation electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 20231 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www>iaspto.gov/ebc/efs/downloads/documents>htmi> 9 EFS Submission 
User Manual - ePAVE) 

2. U.S. Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7 th Floor, Examiner Name, 
Sequence Information, Crystal Mall One, 1911 South Clark Street, Arlington, VA 22202 
Or 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two, 
201 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United FarceF Service, or other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Room 1B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 

Revised 01/29/2002 
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RAW SEQUENCE LISTING DATE: 02/13/2002 

PATENT APPLICATION: US/10/04 8 , 14 6 TIME: 10:07:52 



Input Set : A:\EP.txt 

Output Set: N:\CRF3\02132002\J048146.raw 



3 <110> APPLICANT: The Government of the United States of America as represented by the 

4 Secretary of the Department of Health and Human Services, Centers for Disease 

5 Control and Prevention 

6 Tsang, Victor D08S Not Comply 

7 Greene, Ryan Corrected Diskette Needed 

8 Wilkins, Patricia wv,w 

9 Hancock, Kathy 

11 <120> TITLE OF INVENTION: METHODS AND COMPOSITIONS FOR DETECTING LARVAL TAENIA SOLIUM 

13 <130> FILE REFERENCE: 6395-62068 
C--> 15 <140> CURRENT APPLICATION NUMBER: US/10/048,146 
C--> 15 <141> CURRENT FILING DATE: 2002-01-25 

15 <150> PRIOR APPLICATION NUMBER: US 60/147,318 

16 <151> PRIOR FILING DATE: 1999-08-03 

18 <150> PRIOR APPLICATION NUMBER: PCT/US00/21173 

19 <151> PRIOR FILING DATE: 2000-08-03 
21 <160> NUMBER OF SEQ ID NOS : 9 

23 <170> SOFTWARE: Patentln version 3.1 

25 <210> SEQ ID NO: 1 

26 <211> LENGTH: 2153 

27 <212> TYPE: DNA 

2 8 <213> ORGANISM: Taenia solium 

30 <220> FEATURE: 

31 <221> NAME/KEY: CDS 

32 <222> LOCATION: (145).. (531) 

33 <223> OTHER INFORMATION: f 

36 <400> SEQUENCE: 1 

37 ctgcagtgaa gttgacaagt agttgaccat ttacggaaca tcaatggagg acactttggt 60 
39 agggaaagca tacgataaac ataaaccaat gctggttata taagagacga tctcggctac 120 

41 acttgtaact gaacaacctg taga atg cgt gcc tac att gtg ctt etc get 171 

42 Met Arg Ala Tyr lie Val Leu Leu Ala 

43 15 

4 5 etc act gtt ttc gta gtg acg gtg teg gcc gag tgg gtg ccc att teg 219 
4 6 Leu Thr Val Phe Val Val Thr Val Ser Ala Glu Trp Val Pro He Ser 
47 10 15 20 25 

4 9 agt gtc cac ata gcc tea tgc aaa age cac tac atg ttc caa tta aaa 

50 Ser Val His He Ala Ser Cys Lys Ser His Tyr Met Phe Gin Leu Lys 

51 30 35 40 

53 cgc ttt ttt gcc ttt agg aaa aac aaa ccg aaa gat gtt gca aat agt 315 

54 Arg Phe Phe Ala Phe Arg Lys Asn Lys Pro Lys Asp Val Ala Asn Ser 

55 45 50 55 

57 acg aaa aaa ggg ata gaa tat gtc cac gaa ttc ttc cac gaa gac ccg 363 

58 Thr Lys Lys Gly He Glu Tyr Val His Glu Phe Phe His Glu Asp Pro 

59 60 65 70 



267 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/048 , 14 6 



DATE: 02/13/2002 
TIME: 10:07:52 



Input Set : A:\EP.txt 

Output Set: N:\CRF3\02132002\J048146.raw 

61 att ggt aaa caa att get caa etc gca aag gaa tgg aag gaa gca atg 

62 " ' 
63 



He Gly Lys Gin He Ala Gin Leu Ala Lys Glu Trp Lys Glu Ala Met 



75 



80 



85 



65 ttg gaa ggt agg ttt tgg tgt ttt ctg tea gaa gaa aat tat eta ttc 

66 Leu Glu Gly Arg Phe Trp Cys Phe Leu Ser Glu Glu Asn Tyr Leu Phe 

100 105 



67 90 



95 



69 att cat eta gac aaa ggc aaa ata egg acg tea ctg gtt gag cac tgc 



70 
71 



He His Leu Asp Lys Gly Lys He Arg Thr Ser Leu Val Glu His Cys 
110 115 120 

73 aaa ggt cct aag aaa aaa act get taacttgtca actttcatgc gttcttctct 

74 Lys Gly Pro Lys Lys Lys Thr Ala 

75 ^ 125 

77 tcactaataa atgetcatta ataagaaagc tgccttttgc aagatcaacg agggecatag 
79 actgtgaggg ttatagecta aggttatggg gtgaaatgag ataggaattg agcatttgag 
81 aagttactaa tttaaattga aagcegcatt tettctgeaa ttgacgtgtg atggttagcg 
83 aaaccaagtg aagcacgacc tcttgagtcg tttcaacagc cgccagtggt ttcaccagtg 
85 gcttcaccag tgggtagact ggtttgtcac acatgegagg tacggtcaga gggctaacag 
87 gtgtggtgga ggggecaaca cgtgtaagac aagcagttcc ccttctctgt cgtgaggcac 
89 actcagcacc cacctcgttt acttctccct tgacgactgt aatgcatttg gggtcaccat 
91 gcccccgcca agttgaaggc actgatgaca tttgtaccat atcaccgata agtattaact 
93 cttccacttc ccagattttg aggtcaggcg atcctactga ctcggtgtag ccccatggtg 
95 gtccatgctc tgcaccattc gctgttcagt ggagcatcca cctagacggc caaccaatct 
97 cgcctccctt ctcctgtgct caagatgtgc gtcggtgaga tttggagggt ctgatcacca 
99 tactaaccac gtaggtttca tcatctctaa gaagcaccac ttcttgaggt cgcattgtgt 
101 accaccagcc ggtgtaatca agagtgactt tcgcgtcacc cctaagaagg ctatagatct 
103 gcaagtcagc gcaatagctt cagccatgct gactaaaatg tgtaagggac cagtagctct 
105 agcccaacac aagtggagct aataatgggc ttccccagat acatgaatcc caaateggtg 
107 ageatgggee atgaatatgg ccttctgagt cttccttgaa tgcaaacgaa ggcatagcac 
109 gagggtagga tgagtgtaca gaaaacagcg aggcaacgaa tctactggca tggecctgat 
111 gccaccccgc ccagctaggg tagtttggcc acctcagtcc ttaatcgaat geggcagtea 
113 gaacaaacaa agtattacat agccacactc ttcttttgag cgtcgtcctc gacgctcctt 
115 tcgacacacc tcccgcatca gccaccacaa agtaatcagt actggggaga cacccacgag 
117 ctaaccgtgc cagtcatgga aaatttgacg gcaactgagg agatgectga ccccctttgg 
119 cagttcgaat gctgcccgtg gtcaaactcc tgcatcagcc atcacctacg attcaaacat 
121 cctagtcgcc aaattttcgt gaaccctcta aaattttcgt gcactctcaa gacacttcca 
123 actgacttag agctttttca tttggtgaga acaegtaaaa gcttcaagta aacaacaggc 
125 aacgatttca etttgatget ctcaccatca attctcttgt atgtgccacc accttaaacc 
127 ctccctgacc acttccactc tctctctctc cctaaataac aacacttgga agcatgaatg 
129 gtgtctgtca aagttacacc cctagactgc ag 

132 <210> SEQ ID NO: 2 

133 <211> LENGTH: 129 

134 <212> TYPE: PRT 

135 <213> ORGANISM: Taenia solium 
137 <400> SEQUENCE: 2 

139 Met Arg Ala Tyr He Val Leu Leu Ala Leu Thr Val Phe Val Val Thr 

140 1 5 10 15 

143 Val Ser Ala Glu Trp Val Pro He Ser Ser Val His He Ala Ser Cys 

144 20 25 30 

147 Lys Ser His Tyr Met Phe Gin Leu Lys Arg Phe Phe Ala Phe Arg Lys 



411 



459 



507 



561 



621 
681 
741 
801 
861 
921 
981 

1041 

1101 

1161 

1221 

1281 
1341 
1401 
1461 
1521 
1581 
1641 
1701 
1761 
1821 
1881 
1941 
2001 
2061 
2121 
2153 
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RAW SEQUENCE LISTING DATE: 02/13/2002 

PATENT APPLICATION: . US/10/04 8 , 146 • TIME: 10:07:52 

Input Set : A:\EP.txt 

Output Set: N:\CRF3\02132002\J04 8146.raw 

148 35 40 45 



151 


Asn Lys 


Pro Lys 


Asp 


Val Ala 


Asn 


Ser 


Thr 


Lys Lys Gly 


He 


Glu Tyr 


152 


50 






55 








60 






155 


Val His 


Glu Phe 


Phe 


His Glu 


Asp 


Pro 


He 


Gly Lys Gin 


He 


Ala Gin 


156 


65 








70 








75 




80 


159 


Leu Ala 


Lys Glu 


Trp 


Lys Glu 


Ala 


Met 


Leu Glu Gly Arg 


Phe 


Trp Cys 


160 








85 








90 






95 


163 


Phe Leu 


Ser Glu 


Glu 


Asn Tyr 


Leu 


Phe 


He 


His Leu Asp 


Lys 


Gly Lys 


164 






100 








105 






110 




167 


lie Arg 


Thr Ser 


Leu 


Val Glu 


His 


Cys 


Lys 


Gly Pro Lys 


Lys 


Lys Thr 


168 






115 






120 






125 






171 


Ala 






















175 


<210> 


SEQ ID NO 


: 3 
















176 


<211> 


LENGTH: 298 
















177 


<212> 


TYPE: DNA 


















178 


<213> 


ORGANISM: 


Taenia solium 












180 


<220> 


FEATURE : 


















181 


<221> 


NAME/KEY : 


CDS 
















182 


<222> 


LOCATION: 


(3) 


. .(224) 














183 


<223> 


OTHER INFORMATION: 














186 


<400> 


SEQUENCE : 


3 
















187 


ta ttc gta gtg gcg gtt teg gec gag aaa aac aaa ccg aag tgt gat 



188 Phe Val Val Ala Val Ser Ala Glu Lys Asn Lys Pro Lys Cys Asp 



189 




1 






5 








10 






15 




191 


gca 


aat 


agt 


act 


aag aaa 


gag 


ata 


gaa 


tat 


ate 


cac aat 


tgg 


ttt ttc 


95 


192 


Ala 


Asn 


Ser 


Thr 


Lys Lys 


Glu 


He 


Glu 


Tyr 


He 


His Asn 


Trp 


Phe Phe 




193 










20 








25 








30 




195 


cat 


gat 


gac 


ccg 


att gga 


aaa 


caa 


att 


get 


caa 


etc gca 


aag 


gac tgg 


143 


196 


His 


Asp 


Asp 


Pro 


He Gly 


Lys 


Gin 


He 


Ala 


Gin 


Leu Ala 


L ys 


Asp Trp 




197 








35 








40 








45 






199 


aat 


gaa 


aca 


gtg 


cag gaa 


gec 


aaa 


ggc 


aaa 


ttt 


tgg gcg 


tea 


ctg get 


191 


200 


Asn 


Glu 


Thr 


Val 


Gin Glu 


Ala 


Lys 


Gly 


Lys 


Phe 


Trp Ala 


Ser 


Leu Ala 




201 






50 








55 








60 








203 


gag 


tac 


tgc 


aga 


ggt ctg 


aag 


aac 


aaa 


act 


get 


taacttgtca actttcatgc 


244 


204 


Glu 


Tyr 


Cys 


Arg 


Gly Leu 


Lys 


Asn 


Lys 


Thr 


Ala 










205 




65 








70 



















207 gttcttctct tcaccaataa atgetgatta acaagaaaaa aaaaaaaaaa aaaa 298 

210 <210> SEQ ID NO: 4 

211 <211> LENGTH: 74 

212 <212> TYPE: PRT 

213 <213> ORGANISM: Taenia solium 
215 <400> SEQUENCE: 4 

217 Phe Val Val Ala Val Ser Ala Glu Lys Asn Lys Pro Lys Cys Asp Ala 

218 15 10 15 

221 Asn Ser Thr Lys Lys Glu He Glu Tyr He His Asn Trp Phe Phe His 

222 20 25 30 

225 Asp Asp Pro He Gly Lys Gin He Ala Gin Leu Ala Lys Asp Trp Asn 

226 35 40 45 

229 Glu Thr Val Gin Glu Ala Lys Gly Lys Phe Trp Ala Ser Leu Ala Glu 
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RAW SEQUENCE LISTING DATE: 02/13/2002 

PATENT APPLICATION: US/10/04 8 , 14 6 TIME: 10:07:52 

Input Set.: A:\EP.txt 

Output Set: N:\CRF3\02132002\J048146.raw 

230 50 55 60 

233 Tyr Cys Arg Gly Leu Lys Asn Lys Thr Ala 

234 65 70 

237 <210> SEQ ID NO: 5 

238 <211> LENGTH: 294 

239 <212> TYPE: DNA 

240 <213> ORGANISM: Taenia solium 

242 <2 2 0> FEATURE: 

243 <221> NAME/KEY: CDS 

244 <2 2 2> LOCATION: (3).. (221) 

245 <223> OTHER INFORMATION: 

248 <400> SEQUENCE: 5 

249 tt ttc gta gtg gcg gtg teg gec gag gaa act aaa cca gag gac gtg 47 

250 Phe Val Val Ala Val Ser Ala Glu Glu Thr Lys Pro Glu Asp Val 

251 1 5 10 15 

253 gta aag aat att aag aaa ggg atg gaa gtt gtc tac aaa ttt ttc tac 95 

254 Val Lys Asn He Lys Lys Gly Met Glu Val Val Tyr Lys Phe Phe Tyr 

255 * 20 25 30 

257 gaa gac ccg ttg gga aag aaa ata get caa etc gca aag gac tgg aag 14 3 

258 Glu Asp Pro Leu Gly Lys Lys He Ala Gin Leu Ala Lys Asp Trp Lys 

259 35 40 ' 45 
gaa gca atg ttg gaa gee aga age aaa gtg egg gcg tea ctg get gag 191 
Glu Ala Met Leu Glu Ala Arg Ser Lys Val Arg Ala Ser Leu Ala Glu 

263 50 55 60 

265 tac ate aga ggt etc aag aac gaa get get taacttgtca actttcatgc 241 

266 Tyr He Arg Gly Leu Lys Asn Glu Ala Ala 

267 65 70 

269 gttcttctct tcactaataa atgetcatta ataagaaaaa aaaaaaaaaa aaa 294 

272 <210> SEQ ID NO: 6 

273 <211> LENGTH: 73 

274 <212> TYPE: PRT 

275 <213> ORGANISM: Taenia solium 
277 <400> SEQUENCE: 6 

279 Phe Val Val Ala Val Ser Ala Glu Glu Thr Lys Pro Glu Asp Val Val 

280 1 5 10 15 

2 83 Lys Asn He Lys Lys Gly Met Glu Val Val Tyr Lys Phe Phe Tyr Glu 
284 20 25 30 

287 Asp Pro Leu Gly Lys Lys He Ala Gin Leu Ala Lys Asp Trp Lys Glu 

288 ' 35 40 45 

2 91 Ala Met Leu Glu Ala Arg Ser Lys Val Arg Ala Ser Leu Ala Glu Tyr 
292 50 55 60 

295 He Arg Gly Leu Lys Asn Glu Ala Ala 

296 65 70 

299 <210> SEQ ID NO: 7 

300 <211> LENGTH: 6 

301 <212> TYPE: PRT 

302 <213> ORGANISM: Taenia solium 
304 <400> SEQUENCE: 7 

306 He Ala Gin Leu Ala Lys 



261 
262 
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RAW SEQUENCE LISTING 
PATENT APPLICATION: US/10/04 8,14 6 

Input Set : A:\EP.txt 

Output Set: N:\CRF3\02132002\J048146.raw 



DATE: 02/13/2002 
TIME: 10:07:52 



307 1 5 

310 <210> SEQ ID NO: 8 

311 <211> LENGTH: 24 

312 <212> TYPE: PRT 

313 <213> ORGANISM: Taenia solium ^ 

315 <220> FEATURE: 

316 <221> NAME/KEY: variant 

317 <222> LOCATION: (7).. (8) 

318 <223> OTHER INFORMATION /'Amino acid at position 7 may also be valine 

321 <220> FEATURE: 

322 <221> NAME/KEY: site 

323 <222> LOCATION: (21).. (22) 

324 <223> OTHER INFORMATION: Asparagine at position 21 is an amino acid insertion 

327 <220> FEATURE: 

328 <221> NAME/KEY: variant 

329 <222> LOCATION: (14) . .(15) 

330 <223> OTHER INFORMATION ^Amino acid at position 14 may al so be glycine 

333 <220> FEATURE: ^ _ _ _ 

334 <221> NAME/KEY: variant 

(18) 



335 <222> LOCATION: (18).. (19) 

336 <223> OTHER INFORMATION ^Sni no acid at position 18 may also be valine 

339 <220> FEATURE: 

340 <221> NAME/KEY: variant 

341 <222> LOCATION: (19).. (20)^, 

342 <223> OTHER INFORMATION :/'Ami no acid at position 19 may also be histidine 

345 <220> FEATURE: 

346 <221> NAME/KEY: variant 

(20).. (21)^ 



He Glu 
15 



347 <222> LOCATION: x , _ 

348 <223> OTHER INFORMATION :/Minoacid at position 20 may also be arginine 
351 <400> SEQUENCE: 8 /^\ 

353 Lys Asn Lys Pro Lys Asp(Asp\Ala Ala Ser Thr Lys Lys 

354 1 /^\^\^\ 5 10 

357 Tyr (ilejTrdf His ksn Phe Phe Phe 

358 K^A-^S&S , 

361 <210> SEQ ID NO: 9 

362 <211> LENGTH: 13 

363 <212> TYPE: PRT 

364 <213> ORGANISM: Taenia solium 

366 <220> FEATURE: 

367 <221> NAME/KEY: variant 

368 <222> LOCATION 

369 <223> OTHER INFORMATION 

372 <220> FEATURE: 

373 <221> NAME/KEY: variant 

374 <222> LOCATION: (12).. (13^, 

375 <223> OTHER INFORMATION /Amino acid at position 12 may also be aspartic acid^ 

variant 




(5). .(6) ^?r- 

N^Amino acid at position 5 may also be isoleucine 




378 <220> FEATURE: 

379 <221> NAME/KEY: 




380 <222> LOCATION: (7) . . (9) 
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VERIFICATION SUMMARY DATE : 02/13/2002 

PATENT APPLICATION: US/10/048,14 6 TIME: 10:07:53 

Input Set : A:\EP.txt 

Output Set: N:\CRF3\02132002\J048146.raw 

L-15 M-270 C- Current Application Number differs, Replaced Current Application 
Lil5 M-271 C: Current Filing Date differs, Replaced Current Filing Date 
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